A Model for Multimodal Representation and Inference
نویسندگان
چکیده
In this paper some applications of a theory for representation and inference in multimodal scenarios is presented. The theory is focused on the relation between natural language and graphical expressions. A basic assumption is that graphical expressions belong to a language with well-defined syntax and semantics: a graphical language. A second assumption is that the relation between expressions of different modalities is similar to the relation of translation that holds between expressions of different natural languages. In this paper a multimodal system of representation and inference based on this view of modality is described. First, a brief introduction to the representational structures of the multimodal system is presented. Then, a number of multimodal inferences supported by the system are illustrated. These examples show how the multimodal system of representation can support the definition and use of graphical languages, perceptual inferences for problem-solving and interpretation of multimodal messages. Finally, the intuitive notion of modality underlying this research is discussed. 1. Multimodal Representation The system of multimodal representation that is summarized in this paper is illustrated in Figure 1. The notion of modality in which the system is based is a representational notion: information conveyed in one particular modality is expressed in a representational language associated with the modality. Each modality in the system is captured through a particular language, and relations between expressions of different modalities are captured in terms of translation functions from basic and composite expressions of the source modality into expressions of the object modality. This view of multimodal representation and reasoning has been developed in [13], [17], [9], [18] and [19], and it follows closely the spirit of Montague’s general semiotic programme [5]. The theory is targeted to define natural language and graphical interactive computer systems and, as a consequence, the model is focused in these two modalities. However, the system is also used to express conceptual information in a logical fashion and, depending on the application, the circle labeled L might stand for first-order logic or any other symbolic language as long as the syntax is well-defined and the language is given a model-theoretical semantic interpretation. The circles labeled L and G in Figure 1 stand for sets of expressions of the natural and graphical languages respectively, and the circle labeled P stands for the set of graphical symbols constituting the 1 To be publish also in “Visual Representations and Interpretations”, Springer-Verlag, 1998. FIGURE 1. Multimodal system of representation. Graphical Language FL
منابع مشابه
A Critical Visual Analysis of Gender Representation of ELT Materials from a Multimodal Perspective
This content analysis study, employing a multimodal perspective and critical visual analysis, set out to analyze gender representations in Top Notch series, one of the highly used ELT textbooks in Iran. For this purpose, six images were selected from these series and analyzed in terms of ‘representational’, ‘interactive’ and ‘compositional’ modes of meanings. The result indicated that there are...
متن کاملA model for distribution centers location-routing problem on a multimodal transportation network with a meta-heuristic solving approach
Nowadays, organizations have to compete with different competitors in regional, national and international levels, so they have to improve their competition capabilities to survive against competitors. Undertaking activities on a global scale requires a proper distribution system which could take advantages of different transportation modes. Accordingly, the present paper addresses a location-r...
متن کاملAn Efficient Cluster Head Selection Algorithm for Wireless Sensor Networks Using Fuzzy Inference Systems
An efficient cluster head selection algorithm in wireless sensor networks is proposed in this paper. The implementation of the proposed algorithm can improve energy which allows the structured representation of a network topology. According to the residual energy, number of the neighbors, and the centrality of each node, the algorithm uses Fuzzy Inference Systems to select cluster head. The alg...
متن کاملThe Significance of Multimodality/Multiliteracies in Iranian EFL Learners’ Meaning- Making Process
The main objective of this study was to investigate how Iranian EFL learners used their literacy practices and multimodal resources to mediate interpretation and representation of an advertisement text and construct their understanding of it. Fifteen female adolescents at an intermediate level of proficiency read the "مبلمان برلیان" (“Brelian Furniture”) advertisement text and re-created their ...
متن کاملA Multimodal Variational Approach to Learning and Inference in Switching State Space Models
An important general model for discrete-time signal processing is the switching state space (SSS) model, which generalizes the hidden Markov model and the Gaussian state space model. Inference and parameter estimation in this model are known to be computationally intractable. This paper presents a powerful new approximation to the SSS model. The approximation is based on a variational technique...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998